June 5, 2008. To be submitted to ApJ. 

Preprint typeset using I^T^X style cmulateapj v. 10/09/06 



PROBING REIONIZATION WITH THE 21 CM-GALAXY CROSS POWER SPECTRUM 

Adam Lidz 1 , Oliver Zahn 2 , Steven R. Furlanetto 3 , Matthew McQuinn 1 , Lars Hernquist 1 , Matias 

Zaldarriaga 1-4 

June 5, 2008. To be submitted to ApJ. 

ABSTRACT 

The cross-correlation between high redshift galaxies and 21 cm emission from the high redshift 
intergalactic medium (IGM) promises to be an excellent probe of the Epoch of Reionization (EoR) . 
On large scales, the 21 cm and galaxy fields are anti-correlated during most of the reionization epoch. 
However, on scales smaller than the size of the H II regions around detectable galaxies, the two fields 
become roughly uncorrelated. Consequently, the 21 cm-galaxy cross power spectrum provides a tracer 
of bubble growth during reionization, with the signal turning over on progressively larger scales as 
reionization proceeds. The precise turnover scale depends on the minimum host mass of the detectable 
galaxies, and the galaxy selection technique. Measuring the turnover scale as a function of galaxy 
luminosity constrains the characteristic bubble size around galaxies of different luminosities. The cross 
spectrum becomes positive on small scales if ionizing photons fail to escape from low mass galaxies, 
and these galaxies are detectable longward of the hydrogen ionization edge, because in this case some 
identifiable galaxies lie outside of ionized regions. LOFAR can potentially measure the 21 cm-galaxy 
cross spectrum in conjunction with mild extensions to the existing Subaru survey for z = 6.6 Lyman- 
alpha emitters, while the MWA is slightly less sensitive for detecting the cross spectrum. A futuristic 
galaxy survey covering a sizable fraction of the MWA field of view (~ 800 deg 2 ) can probe the scale 
dependence of the cross spectrum, constraining the filling factor of H II regions at different redshifts 
during reionization, and providing other valuable constraints on reionization models. 

Subject headings: cosmology: theory - intergalactic medium - large scale structure of universe 



1. INTRODUCTION 

Detecting 21 cm emission from the high redshift inter- 
galactic medium (IGM) will potentially revolutionize our 
understanding of cosmic reionization. The 21 cm signal 
promises direct, three-dimensional information regarding 
the state of the IGM during reionization (e.g. Scott & 
Rees 1990, Madau et al. 1997, Furlanetto et al. 2006a). 
Unfortunately, experimental challenges are substantial. 
In particular, astrophysical foregrounds are expected to 
be four to five orders of magnitude larger than the signal 
from the high redshift IGM. However, known foreground 
contaminants are spectrally smooth and should be dis- 
tinguishable from the high redshift 21 cm signal itself 
(Zaldarriaga et al. 2004). 

Given that we anticipate observational complications, 
it is important to develop diagnostics to confirm that the 
detected 21 cm signal indeed originates from the high 
redshift IGM. One such approach is to measure the cross 
correlation between 21 cm emission and a high redshift 
galaxy survey (Furlanetto & Lidz 2007, Wyithe & Loeb 
2007). Since most of the anticipated foregrounds come 
from low redshift - primarily galactic synchrotron - and 
not from high redshift galaxies, the mean 21 cm-galaxy 
cross power spectrum signal is largely immune to fore- 
ground contamination (Furlanetto & Lidz 2007) . Detect- 
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ing a 21 cm-galaxy cross correlation should hence con- 
firm that the detected 21 cm signal comes from the high 
redshift IGM. Moreover, continuing efforts are pushing 
galaxy surveys towards higher redshifts, and it is natural 
to consider the information that may be gleaned from 
combining galaxy and 21 cm surveys. Detecting galaxies 
at very high redshift is extremely challenging (e.g. Stark 
et al. 2007, Bouwens et al. 2008), but we will show here 
that a cross spectrum detection may already be possible 
with modest extensions to the Subaru survey (Kashikawa 
et al. 2006; see also Wyithe & Loeb 2007, Furlanetto & 
Lidz 2007). 

In addition to these important practical advantages, 
the 21 cm-galaxy cross correlation is potentially sensitive 
to the size and filling factor of H II regions, the dumpi- 
ness of the IGM, and the nature of the ionizing sources. 
The 21 cm-galaxy cross correlation will also provide a 
more direct tracer of the interplay between the reioniz- 
ing sources and the surrounding IGM, than the 21 cm 
auto power spectrum. The 21 cm-galaxy cross correla- 
tion should hence provide a unique and powerful probe 
of the Epoch of Reionization (EoR) and early structure 
formation. Here we follow up on earlier work by Wyithe 
& Loeb (2007) and Furlanetto & Lidz (2007), and focus 
on modeling the 21 cm-galaxy cross power spectrum, and 
exploring the insights that future surveys will provide re- 
garding the EoR. 

The outline of this paper is as follows. In §2 we estab- 
lish notation, describe the reionization simulations used 
in our analysis, and examine the basic simulated signal. 
We then characterize (§3) the dependence of the cross 
spectrum on redshift and ionization fraction. In §4 we 
illustrate how the signal is sensitive to the properties of 
the ionizing sources. In §5 we describe its variation with 
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the abundance of Lyman-limit systems. We then exam- 
ine the signal's dependence on the way in which high 
rcdshift galaxies are selected (§6), contrasting the results 
for Ly-a selected galaxies with galaxies selected through 
other techniques. In §7 we briefly discuss the statistical 
power of future surveys to constrain reionization through 
measurements of the 21 cm-galaxy cross power spectrum. 
Finally, we summarize our main results and conclude in 
§8- 

Throughout we consider a ACDM cosmology param- 
eterized by: n s = 1, cr 8 = 0.8, Q m = 0.27, Oa = 0.73, 
Slf, = 0.046, and h = 0.7, (all symbols have their usual 
meanings), consistent with the WMAP constraints from 
Spergel et al. (2007) and Komatsu ct al. (2008). 

2. THE 21 CM-GALAXY CROSS POWER SPECTRUM 

In this paper we focus on the cross power spectrum 
between the 21 cm and galactic abundance fields. Each 
field is non-Gaussian and so the cross and auto power 
spectra alone provide an incomplete description of the 
fields' statistical properties. However, the limited sensi- 
tivity of first generation 21 cm surveys will prohibit de- 
tailed imaging of the 21 cm field (McQuinn et al. 2006), 
and we expect these surveys to have relatively low signal 
to noise for detecting higher order moments of the 21 
cm field. We hence focus on the cross power spectrum 
throughout. 

In order to explore the information content of the 
21 cm- galaxy cross power spectrum, it is useful to de- 
compose the signal into the sum of several contributing 
terms. Throughout this work we adopt the limit that the 
spin temperature of the 21 cm transition is much higher 
than the CMB temperature globally (Ciardi & Madau 
2003, Pritchard & Furlanetto 2007), T s » T CM b, and 
we ignore peculiar velocities - which should be a good 
approximation during most of the reionization epoch 
(Mesinger & Furlanetto 2007a). With these assumptions 
the 21 cm-galaxy cross power spectrum can be written 
as: 

AL gal (fc) - A2 lgal (fc)/T = (x m )[Al gal (k) 

+A2 gal (fc) + A^ gal (fc)]. (1) 

In this equation ga i(fc) denotes the cross power spec- 
trum between the 21 cm field and the galaxy ovcrdcnsity 
at wavenumber k = |fe|. The 21 cm field at spatial posi- 
tion r is given by 5 T (r) = T (x H ) (1 + S x (r)) (1 + 8 p (r)); 
T is the 21 cm brightness temperature, relative to the 
CMB, at the redshift in question for a fully neutral gas 
element at the cosmic mean density, and (xh) is the 
volume-averaged hydrogcnic neutral fraction. The field 
§x(r) = {xn{r) — (xh))/(xh) is the fractional fluctu- 
ation in the neutral hydrogen fraction, while S p is the 
fractional gas density fluctuation. Similarly S g (r) = 
(n g (r) — (n g ))/(n g ) is the fractional fluctuation in galaxy 
abundance, where n g (r) specifies the co-moving num- 
ber density of galaxies at spatial position r, and (n g ) 
denotes the volume-averaged galactic abundance. Our 
notation labels the dimensionless cross spectrum of two 
random fields, a and 6, by A ab (fc) = k 3 P a , h (k) / (2n 2 ) 

and A£ gal , for example, is shorthand for the cross power 
spectrum between S x and S g (and P aj b is the usual di- 
mensionful cross spectrum). We use a similar shorthand 
for A pgal (fc) and A^ pgal (fc). Throughout we work with 



the power spectrum of the dimensionless field 8t(?)/Tq 
which we denote by Ai^ ^(fc), which is distinguished 

from the dimensionful power spectrum A^ ga i(&) by the 
factor of To as in Equation 1 . 

The individual terms contributing to the 21 cm-galaxy 
cross power spectrum have the following physical inter- 
pretations. The first term, A^ gal (k), represents the cross 
power spectrum between the neutral hydrogen fraction 
and galaxy density fields. The second term, A pgal (fc), is 
the cross power spectrum between the matter and galaxy 
overdensity fields. The final term, A^ pgal (fc), is a three 
field term which would vanish if each contributing field 
were Gaussian. In fact, we show below that this term is 
generally significant during reionization (see Lidz et al. 
2007a for discussion of a similar term, A?, (ft), which 
contributes to the 21 cm auto power spectrum). We will 
sometimes refer to these terms respectively as the 'x-gal', 
'p — gal', and 'three-field' terms. Let us examine each 
contributing term from our reionization simulations. 

2.1. Reionization Simulations 

First, we briefly describe the two types of reioniza- 
tion simulations used in this work. The first type are 
the reionization simulations of McQuinn et al. (2007b). 
In these simulations, radiative transfer is treated in a 
post-processing stage using the code of McQuinn et al. 
(2007a), a refinement of the Sokasian et al. (2001, 2003) 
code, which in turn uses the adaptive ray-tracing scheme 
of Abel & Wandelt (2002). The radiative transfer calcu- 
lation is performed on top of a 130 Mpc/h, 1024 3 particle 
dark matter simulation run with an enhanced version of 
Gadget-2 (Springcl 2005). The minimum resolved halo 
in this simulation is ~ 1O 1O M0, but smaller mass halos 
down to the atomic cooling mass (Barkana & Loeb 2001), 
M coo \ ~ 10 8 M©, are incorporated with the appropriate 
abundance and clustering as in McQuinn et al. (2007a). 
Ionizing sources are placed in simulated halos with sim- 
ple prescriptions. In our fiducial model, we assume that 
a source's ionizing luminosity is proportional to its host 
halo mass. We assume that gas directly traces the dark 
matter, which should be a good approximation on the 
large scales of interest here. 

Second, we use an improved version of the hybrid simu- 
lation technique of Zahn et al. (2007) , which is essentially 
a Monte-Carlo implementation of the analytic model de- 
veloped by Furlanetto et al. (2004). This technique has 
the advantage of being extremely fast, while maintaining 
accuracy. In comparison to the scheme described in Zahn 
et al. (2007), our present scheme is improved in several 
ways. First, we use 2nd-order Lagrangian Perturbation 
Theory (2LPT) to generate realizations of the density 
field (Crocce et al. 2006) during reionization (as in Lidz 
et al. 2007b), rather than generating Gaussian random 
fields. This allows us to incorporate quasi-linear effects. 
Next, we use a scheme similar to that of Mesinger & 
Furlanetto (2007a) to predict the halo distribution from 
an initial, linear displacement field. 

2.2. Basic Simulated Signal 

Let us now examine the main features of the simu- 
lated signal. To begin with, we consider the McQuinn 
et al. (2007b) simulations, and focus on a model in 
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Fig. 1. — Simulated maps of the density, halo, ionization, and 21 cm fields. Each map is 130 Mpc/h on a side and is drawn from a 
simulation snapshot at z = 7.32 at which point (xi) = 0.54 in our model. The density, ionization, and 21 cm maps are each 1-cell thick 
(0.25 Mpc/ft), while the halo field is from a 60-cell (15 Mpc//i) wedge. On large scales, the bright regions in the overdensity map tend to 
have more halos, be ionized, and be dim in 21 cm. The correspondence between the bright regions in the halo field, and the dim regions 
in the 21 cm field, is the signal we characterize and quantify in this paper. 



which all halos down to the atomic cooling mass con- 
tain sources with an ionizing luminosity proportional to 
host halo mass. Further, we assume that all halos above 
-Wg,min = 10 10 Af Q contain galaxies detectable by our hy- 
pothetical survey. In what follows, this prescription for 
the ionizing sources and the minimum detectable host 
halo mass constitutes our fiducial model. We denote 
the minimum detectable host halo mass by M gjmm , and 
the minimum host halo mass for the ionizing sources by 
Mx.min- Presently we consider a simulation snapshot at 
z = 7.32, at which point the filling factor of ionized re- 



gions in our model is (xi) — 0.54. 

It is illuminating to inspect the simulated fields visu- 
ally before calculating their detailed statistics. In Figure 
1 we show narrow slices through our simulated density, 
halo, ionization, and 21 cm fields. Here one can clearly 
see that the bright regions in the halo map correspond 
to dim regions in the 21 cm map, while dim regions in 
the halo map correspond to bright regions in the 21 cm 
map. This anti- correlation is the signal we characterize 
and calculate in the present paper. As one can see from 
the panels of Figure 1, the anti-correlation arises because 
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Fig. 2. — The 21 cm-galaxy cross power spectrum and its con- 
stituent terms. The signal is shown for our fiducial model at 
(xi) = 0.54. Top panel: The absolute value of the 21 cm-galaxy 
cross power spectrum (black solid line). The blue dashed line shows 
the x-gal cross power spectrum, the magenta long-dashed line shows 
the p — gal cross power spectrum, and the green dotted line shows 
the three-field term. On small scales the three-field and p—gal cross 
power spectra cancel each other out rather closely. For contrast, 
we also show the cross power spectrum between the neutral hydro- 
genic fraction and the density field (cyan dot-dashed line) . Bottom 
panel: The cross correlation coefficient between the 21 cm and 
galaxy fields as a function of wavenumber. The cyan dot-dashed 
line indicates the cross-correlation coefficient between the neutral 
hydrogenic and density fields. The red dotted line indicates zero 
correlation coefficient. The sign of the signal in the top panel can 
be inferred from the correlation coefficient shown here. 

galaxies are more abundant in large scale overdense re- 
gions, which hence ionize before typical regions. As a 
result, the overdense regions contain less neutral hydro- 
gen during reionization, and emit more dimly in 21 cm 
than typical regions, while containing more galaxies (see 
also Wyithe & Loeb 2007). 

In order to quantify these visual impressions, we calcu- 
late and show the 21 cm-galaxy cross power spectrum in 
Figure 2. The top panel shows the absolute value of the 
21 cm-galaxy cross power spectrum, as well as the indi- 
vidual terms of Equation (1). The bottom panel shows 
the cross correlation coefficient between the two fields, 

1/2 

r(k) = P2i,gal(fc)/[-P2i(fc)-Pgal(fe)] • In estimating the 
cross-correlation coefficient here and throughout this pa- 
per, we subtract shot-noise from the galaxy power spec- 
trum (before calculating r(k)) assuming that it is Poisson 
- i.e., we assume P s hot = 1/^gab where n ga i is the abun- 
dance of halos above M g m i n . 

The figure reveals several interesting features of the 
signal. On large scales the 21 cm field is anti-correlated 
with the galaxy field. As explained and visualized in Fig- 
ure 1, this occurs because galaxies form first, and ionize 
their surroundings, in overdense regions. On the other 
hand, on small scales the 21 cm and galaxy fields are 
roughly un-correlated. We can understand this by exam- 



ining the small-scale behavior of the constituent terms, 
as shown in the top panel. The cross power spectrum 
between neutral hydrogen fraction and galactic density 
(A^ gal (fc), the x-gal term) turns over on small scales, 
as indicated by the blue-dashed line. This behavior is 
naturally similar to that of the density-ionization cross 
power spectrum, which turns over on scales smaller than 
the size of the H II regions during reionization (Furlan- 
etto et al. 2004, Zahn et al. 2007). The correlations 
die off on sub-bubble scales because the entire interior 
of each H II region is highly ionized, irrespective of the 
interior density and galaxy fields. For comparison, wc 
additionally plot the cross power spectrum between neu- 
tral hydrogen fraction and matter density. This resem- 
bles the cross power spectrum between neutral hydrogen 
fraction and galactic density, but it turns over on slightly 
smaller scales. As we explore further in §4 and §6.1, the 
turnover is on smaller scales owing to ionized bubbles 
around low mass halos, which host galaxies below the 
detection threshold of our hypothetical galaxy survey. 

The cross power spectrum between the density field 
and the galaxy field is shown by the long-dashed magenta 
line. Note the very strong clustering of these rare galax- 
ies: the cross power spectrum has an amplitude of unity, 
A p, g ai( fc ) ~ 1, on a scale of k ~ 1.8ft Mpc -1 . On the 
same scale the amplitude of the matter power spectrum, 
A pp (fc), is a factor of > 7 smaller. Hence even though 
dark matter clustering is quasi-linear on relevant scales, 
the clustering of detectable host-halos may be quite non- 
linear on the same scales. 

Finally, let us examine the three-field term, A^ pgal (fc). 
This term is negative in our calculations and appears to 
closely cancel out the p — gal cross power spectrum on 
small scales. Owing to this cancellation, the shape of the 
21 cm-galaxy cross power spectrum closely mimics that 
of the x-gal cross power spectrum. The 21 cm-galaxy 
cross correlation may then offer a relatively direct tracer 
of bubble growth during reionization: it traces the x-gal 
term, which turns over on scales smaller than that of 
the H II regions around the minimum mass detectable 
galaxies. We examine this further in §3, but we first 
pause to consider the three-field term more closely. 

In order to understand why the three field and p — gal 
terms cancel each other on small scales, it is helpful to 
combine the two terms into a single one, and consider 
the two-point correlation function rather than the power 
spectrum. Here we use similar reasoning to that of Lidz 
et al. (2007a); see their §3.2. The two terms are com- 
bined as: 

A* gal (fc) + A^ gal (fe) oc F.T. [(x(l)£ p (l)n g (2))] . (2) 

Here 1 and 2 indicate spatial positions X\ and X2, re- 
spectively, while F.T. refers to a Fourier transform. 
This equation follows from expanding a;(l) on the right- 
hand side of the equation as x(l) = (x)(l + S x (l)), 
and using ([I + 5 X {1)] S p (l)n g (2)) = (S p (l)n g (2)) + 

(4(i)£ P (iK(2)>. 

We can write the above two-point function as: 
(x(l)5 p {l)n g (2)) = J dx(l)dS p (l)dn g (2) x 

x(l)S p (l)n g (2)P [x(l),S p (l)\n g (2)} P [n g {2)] . (3) 

Provided we consider separations much smaller than the 
size of the H II regions, a pair of points (1) and (2) will 
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mostly be either each within the same ionized bubble, or 
both outside an ionized bubble. If each point is within a 
bubble then the pixel at position (1) is ionized, x(l) = 0, 
and this gives no contribution to the two-point function 
of Equation (3). On the other hand, spatial points out- 
side of bubbles do not contain detectable galaxies in this 
model (although see §4 for alternate cases), n s (2) = 0, 
and again yield vanishing contributions to the two-point 
function of Equation (3). The two-point function of 
Equation (3) must hence vanish on small scales in our 
fiducial scenario, which explains the cancellation between 
the p — gal and three-field terms seen in Figure 2. The 
ionization field is a 'mask' that surrounds each galaxy in 
this model, eliminating the two-point function of Equa- 
tion 3 on small scales. 

2.3. Hybrid Simulations 

In addition to the full radiative transfer simulations of 
McQuinn et al. (2007a, 2007b), we perform some cal- 
culations with the rapid hybrid scheme of Zahn et al. 
(2007), Mesinger & Furlanetto (2007a). We use hybrid 
simulations with two different boxsizes in this work: one 
has £box = 70 Mpc/ft,, while the other has Lbox = 130 
Mpc/h. The density, halo, ionization, and 21 cm fields 
in each simulation are tabulated on 512 3 grid cells. In 
order to locate the halos using the scheme of Mesinger 
& Furlanetto (2007a), we employ a grid of 1200 3 cells in 
each simulation. The smaller box calculation has higher 
resolution - resolving halos down to M m i n ~ 10 8 Af Q - al- 
lowing us to accurately identify halos with mass around 
the atomic cooling mass. The larger box has coarser 
mass resolution, M min ~ 10 9 M Q , but better captures 
the small-A: 21 cm-galaxy cross spectrum. We refer the 
reader to Zahn et al. (2007), Mesinger & Furlanetto 
(2007a) for a detailed description and tests of the hybrid 
scheme. Here we briefly show that estimates of the 21 
cm-galaxy cross power spectrum from our hybrid simula- 
tions agree well with those from the full radiative transfer 
simulations. 

In order to do this comparison, we use the initial con- 
ditions from the McQuinn et al. (2007b) N-body sim- 
ulation, and generate the halo field and ionization field 
using our hybrid scheme. For the purposes of this com- 
parison, in each of our hybrid and radiative transfer 
calculations, we include ionizing sources only in halos 
that are well resolved by the N-body simulation, with 
Mx.min > 8 x 1O 9 M . That is, here we do not add low 
mass halos into the radiative transfer simulation with 
the appropriate statistical properties as in McQuinn et 
al. (2007a) and other sections of this paper. We limit our 
comparison to masses above 8 x 1O 9 M because these are 
the halos directly resolved in our N-body simulation, be- 
fore small mass halos are included statistically. We cross- 
correlate the resulting 21 cm field with all halos above 
our fiducial choice of M gjln i n = 10 10 AfQ. The results of 
this comparison are shown in Figure 3, for outputs with 
(Xi) just slightly below (x^ = 0.5. 

The agreement between the hybrid and full radiative 
transfer calculations is quite good. In order to check 
how much of the small difference between the two cal- 
culations comes from differences in the 21 cm field, and 
how much from differences in the halo fields, we cross- 
correlate the hybrid 21 cm field with the simulated halo 
field (blue dashed lines) . Differences in the simulated and 
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FlG. 3. — Comparison between the hybrid and simulated 21 cm- 
galaxy cross spectrum. Top panel: The absolute value of the 21 cm- 
galaxy cross power spectrum. Bottom panel: The cross-correlation 
coefficient between the two fields. In each panel, the red dotted 
line shows the results from the full radiative transfer simulation. 
The black solid line shows results from cross-correlating the hybrid 
21 cm field with the hybrid halo field. The blue dashed line shows 
the cross-correlation between the hybrid 21 cm field, and the halo 
field from the N-body simulation. 

hybrid halo fields seem to be important on small scales, 
while differences between the 21 cm fields in the two cal- 
culations lead to most of the difference on large scales. 
Regardless, the hybrid calculations agree well with the 
full radiative transfer ones, and provide a useful means 
to estimate the 21 cm-galaxy cross spectrum rapidly. 

3. REDSHIFT EVOLUTION OF 21 CM-GALAXY CROSS 
POWER SPECTRUM 

Now that we have introduced our simulation tools and 
understand the basic 21 cm-galaxy cross power spectrum 
signal, let us examine its dependence on redshift and ion- 
ization fraction. How does the signal evolve as the filling 
factor of H II regions, and their characteristic size, in- 
crease? To address this, we calculate the 21 cm-galaxy 
cross power spectrum from our radiative transfer simu- 
lations, considering a wide range of redshifts in order to 
span most of the reionization epoch. We start by adopt- 
ing our fiducial model with M gim i n = 10 10 M Q at each 
redshift for simplicity. 

The results of this calculation are shown in Figure 4. 
At early times, near the very beginning of the reioniza- 
tion process ((xi) = 0.02), the galaxy and 21 cm fields are 
positively correlated on large scales. At this stage, galax- 
ies are extremely rare objects and are only just start- 
ing to ionize their surroundings. The galaxies turn on 
first in large scale overdense regions, which contain more 
matter and initially more neutral hydrogen than large 
scale under dense regions. These regions hence glow more 
brightly in 21 cm emission and lead to a positive 21 cm- 
galaxy cross-correlation on large scales, as shown by the 
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Fig. 4. — Redshift evolution of the 21 cm-galaxy cross 
power spectrum in our fiducial model. Top panel: The 
absolute value of the 21 cm-galaxy cross power spec- 
trum at different ionization fractions/redshifts. The 
redshifts and ionization fractions shown are ((xi),z) = 
(0.02, 11.46); (0.15, 8.76); (0.21, 8.34); (0.54, 7.32); (0.82, 6.90); 
and (0.96,6.77). Bottom panel: The cross-correlation coefficient 
between the 21 cm and galaxy fields as a function of wavenumber. 



black solid line in Figure 4. 

The galaxies quickly ionize their overdense surround- 
ings which consequently dim in 21 cm emission. On the 
other hand, the large scale underdense regions are still 
mostly free of galaxies and roughly maintain their ini- 
tial 21 cm brightness temperature. This leads to a brief 
period where the 21 cm-galaxy cross correlation has low 
amplitude on large scales, as large scale overdense re- 
gions dim in 21 cm emission and roughly equilibrate in 
brightness temperature with large scale underdense re- 
gions (Furlanetto et al. 2004, Wyithe & Morales 2007; 
Lidz et al. 2007b discuss a similar low-amplitude epoch 
for the 21 cm power spectrum). In our fiducial model, 
this 'equilibration phase' occurs when (xi) ~ 0.15, as 
shown by the red dotted line in the figure. This equi- 
libration epoch is relatively brief; the two fields quickly 
become anti-correlated on large scales. 

A caveat to this discussion is that our calculations as- 
sume that the spin temperature of the 21 cm transition is 
globally much larger than the CMB temperature. This 
approximation will be inaccurate early in the reioniza- 
tion process (Pritchard & Furlanetto 2007, Pritchard & 
Loeb 2008), and spin temperature fluctuations may com- 
plicate the cross-correlation signal close to the equilibra- 
tion phase. This is beyond the scope of the present paper, 
but may modify our results at very early times, perhaps 
when (x t ) < 0.1 (Pritchard & Furlanetto 2007). 

More robust, and detectable in the near future, are 
our results during the bulk of the reionization process, 
at which point the 21 cm and galaxy fields are anti- 
correlated. Once the anti-correlation is established, its 



scale dependence varies with redshift and ionization frac- 
tion. This behavior is shown in the green short-dashed, 
blue long-dashed, cyan dot-dashed and magenta dash- 
dotted lines which span model ionization fractions of 
(x^ = 0.21 - 0.96. As discussed in §2, this anti- 
correlation reflects the fact that galaxies turn on first in 
overdense environments and ionize their surroundings. 
As the ionized fraction increases, and the H II regions 
grow, the cross-correlation turns over on progressively 
larger scales. This illustrates that the 21 cm-galaxy cross 
power spectrum provides a relatively direct probe of bub- 
ble growth during reionization. 

We pause here to mention one slight caveat regard- 
ing our modeling of the small scale cross spectrum. The 
galactic sources will themselves contain neutral hydro- 
gen, a feature which is not properly included in our cal- 
culations. (This leads to a 21 cm signal after reionization; 
see Wyithe & Loeb 2008, Chang et al. 2008.) This ne- 
glected contribution should cause the cross spectrum to 
become positive on small scales. Since the signal from 
the diffuse IGM is much stronger on relevant scales than 
this galactic contribution (see Lidz et al. 2007b, their 
§2.3, for an estimate), we do not expect this to confuse 
the determination of the bubble-size induced turnover. 

As remarked previously, the precise turnover scale de- 
pends on the minimum host halo mass of the galaxies 
observed by our hypothetical survey. Roughly speak- 
ing, the turnover is set by the size of ionized regions 
around detectable galaxy hosts, and is insensitive to the 
size of ionized regions around fainter galaxies (see §6.1). 
In practice, the minimum detectable host mass - which 
impacts the turnover scale - may vary with redshift in 
a complicated way, depending on the flux-limit of the 
survey and the correlation between luminosity and halo 
mass. This will make the evolution of the 21 cm-galaxy 
cross spectrum more complicated than the illustrative re- 
sults of Figure 4, which are at fixed minimum host mass. 
In a flux-limited survey, the turnover scale will generally 
evolve less strongly with time: in this case, one detects 
only massive galaxies at early times, which tend to reside 
in larger bubbles than average. In order to disentangle 
the impact of varying minimum host mass and that of 
varying bubble size and ionization fraction, one could 
cross-correlate the 21 cm signal with galaxies of vary- 
ing luminosity and use the galaxy auto spectrum and 
luminosity function to help understand the correlation 
between galaxy luminosity and host halo mass. As we 
show in §6.1, measuring the turnover scale as a function 
of galaxy luminosity, allows one to determine the charac- 
teristic size of ionized bubbles as a function of luminosity. 

The 21 cm auto spectrum itself evolves as the filling 
factor of H II regions increases. Lidz et al. (2007b) 
explored how one might use the redshift evolution of 
the auto spectrum to constrain the evolution of the H 
II region filling factor. The redshift evolution of the 
cross spectrum, as considered here, would provide a com- 
plementary and essentially independent means for con- 
straining H II region growth. Ultimately, combining the 
two measurements should provide a cross check on each 
measurement and increase constraining power. More im- 
portant, the cross spectrum provides a much more di- 
rect indicator of characteristic bubble size than the auto 
spectrum. By measuring the cross spectrum in different 
galaxy luminosity bins, one can additionally determine 



how the bubble size depends on galaxy luminosity, in- 
formation which is not obtainable from the 21 cm auto 
spectrum alone. Note that the ionized regions form un- 
der the collective influence of many individual galaxies, 
but one still expects a statistical trend of bubble size 
with galaxy luminosity: more luminous galaxies tend to 
live in more massive halos, which inhabit larger over- 
densities, and are typically surrounded by larger ionized 
regions. Measuring the turnover in the cross spectrum 
for different galaxy luminosity bins offers a unique means 
of quantifying this trend. We will discuss the statistical 
power of several future surveys to detect the cross spec- 
trum evolution in §7. 

4. DEPENDENCE ON IONIZING SOURCE PROPERTIES 

Now that we understand the basic features of the 21 
cm-galaxy cross spectrum, we consider variations around 
our fiducial model parameters. First, let us examine 
how the signal depends on the properties of the ionizing 
sources. Precisely which sources of light produce most 
of the photons that reionize the IGM is highly uncer- 
tain. This depends on many poorly constrained quanti- 
ties such as the efficiency of star formation as a function 
of galaxy mass, the high redshift stellar initial mass func- 
tion (IMF), the fraction of ionizing photons that escape 
host galaxies to ionize the IGM and its dependence on 
host mass, the degree to which photoionization and su- 
pernova feedback suppress star formation in low mass 
halos, and other factors (e.g. Furlanetto et al. 2006a). 
A promising route to constrain some of these uncertain 
parameters is to study the differing impact these sources 
have on the surrounding IGM. Put simply, the IGM may 
provide a valuable laboratory for studying the first lumi- 
nous sources. In this section we show that measurements 
of the 21 cm-galaxy cross spectrum may help constrain 
ionizing source properties. 

To explore this, let us start with a simple model and 
vary two of our model parameters. First, we vary -Mx imul , 
the minimum mass of halos that host sources contribut- 
ing to reionization. Next, we vary M gjm i n , the mini- 
mum host mass of galaxies detectable by our hypothet- 
ical galaxy survey. We explore the impact of varying 
these parameters using 70 Mpc//i hybrid simulations, 
each normalized to (xj) = 0.5 at z = 6.9. To begin 
with, we fix the parameter M giin i n at 10 10 M Q /h and 
consider M x>min = 10 8 M Q /ft, 10 9 M Q //i, and 10 10 M @ /h 
respectively. 5 These are clearly simplified models, but 
they suffice to illustrate the basic sensitivity of the sig- 
nal to ionizing source properties. These models should 
approximate scenarios in which photo-heating (Thoul & 
Weinberg 1996, Navarro & Steinmetz 1997, Dijkstra et 
al. 2004) or supernova feedback (e.g. Springel & Hern- 
quist 2003) limit the efficiency of star-formation in small 
mass halos and diminish the contribution of these halos 
to reionization. 

Varying M Xjm i n across the range shown in Figure 5 
only weakly influences the 21 cm-galaxy cross spectrum. 
On large scales, the amplitude increases slightly with in- 
creasing -Mx^min, since the bias of the ionized regions is 

5 Note that we generally quote masses in units of Mq, but here 
and in §6.1 (owing to imperfect planning) we use Mq/1i units, and 
so the choice of A^/g m j n = lO lo M0/fe is slightly different than our 
fiducial choice of M gim ; n = 10 10 Mq. 
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Fig. 5. — The 21 cm-galaxy cross power spectrum for models of 
varying M x m j n . In each model curve the efficiency of the ionizing 
sources is adjusted to yield (xi) ~ 0.5 at z = 6.9, and the mini- 
mum detectable galaxy host has a mass of M gim ; n = 10 10 Mq /h. 
Top panel: The absolute value of the 21 cm-galaxy cross power 
spectrum. Bottom panel: The correlation coefficient between the 
galaxy and 21 cm fields in each case. The dependence of the cross 
power spectrum on the host halo mass of the ionizing sources is 
rather mild. 



larger for larger values of M Xjm ; n . Note, however, that 
the small-scale turnover occurs at very similar scales for 
each M x m i n . This occurs because, in each case shown 
here, the minimum detectable galaxy mass is larger than 
(or equal to) M X)m i n . The cross-correlation is mostly 
insensitive to the bubble sizes around smaller mass, un- 
detectable sources. As alluded to earlier, the turnover 
in the cross spectrum depends on the bubble sizes around 
galaxies above the minimum, mass detectable by our hy- 
pothetical galaxy survey, and is mostly insensitive to the 
bubble sizes around lower mass hosts. Note that the 
auto spectra of the ionization and 21 cm fields do de- 
pend on M Xim in (Furlanetto et al. 2006b, McQuinn et al. 
2007a, Lidz et al. 2007b) - models with larger M Xjln i n 
have larger bubbles (on average) at a given (xi). How- 
ever, it appears that the bubble sizes around high mass 
galaxies (with M > M gjln i n ) change only slightly with 
increasing M x . m i n , and hence the turnover scale in the 
cross spectrum is insensitive to M X:m i n . We have veri- 
fied this explicitly by calculating the average ionization 
as a function of distance around halos with M — M gim i n , 
for each of M x , min = 10 8 M Q /h and M Ximin = \Q w M Q /h. 
The ionization profiles around the massive halos are very 
similar in these models, supporting our interpretation. 

Another possibility is that ionizing photons escape effi- 
ciently only from high mass galaxies, and that low mass 
sources do not contribute to reionizing the IGM. This 
possibility is, in fact, suggested by the recent escape frac- 
tion simulations of Gnedin et al. (2008) . Even if low mass 
galaxies have a negligible escape fraction, they may still 
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Fig. 6. — 21 cm-galaxy cross power spectra for models where 
sources in high mass halos produce most of the ionizing photons. 
In each model, the minimum host halo mass of sources that allow 
ionizing photons to escape into the IGM is M XiInin = 1O 1o M //i. 
Top panel: The absolute value of the 21 cm-galaxy cross spectrum 
for galaxy surveys with minimum detectable host halo masses of 
A^g.min = 10 8 , 10 9 and 10 10 M Q //i. Bottom panel: The correlation 
coefficient between the galaxy and 21 cm fields in each case. The 
cross spectrum and correlation coefficient turn positive on small 
scales for cases in which the galaxy survey detects sources with 
mass below the minimum mass ionizing source. 

form stars efficiently and be detectable at wavelengths 
longward of the hydrogen ionization edge. This scenario 
produces an interesting signature in the 21 cm-galaxy 
cross power spectrum, provided one has a galaxy sur- 
vey capable of detecting the, presumably faint, sources 
in these low mass halos. In order to explore this, we fix 
the minimum host halo mass of sources contributing to 
the reionization of the IGM at M^ min = lO w M Q /h and 
calculate the 21 cm-galaxy cross spectrum with a galaxy 
survey probing sources in host halo masses larger than 
each of M g , min = 10 8 , 10 9 , and 10 10 'M Q /h. 

The results of this calculation are shown in Figure 6. 
On large scales, the amplitude of the cross spectrum in- 
creases as one raises the minimum detectable host halo 
mass. This increase simply owes to the usual increase in 
galaxy bias with increasing minimum host halo mass. 
Perhaps more interesting, however, are the results on 
small scales when the minimum detectable host halo 
mass is lower than the minimum ionizing source mass. 
In this case (see the model curves with M gim ; n = 10 8 and 
1O 9 M //i), the cross spectrum turns over on larger scales 
than in the model in which M g m i n = M Iim i n , it then re- 
verses sign, and goes positive on small scales. Detecting 
this behavior would indicate that ionizing photons escape 
only from massive host halos, and not from lower mass 
hosts. An ambitious survey is needed to detect the faint 
sources in low mass halos, and to detect the 21 cm-galaxy 
cross spectrum on small scales (see §7). Nonetheless, the 
proposed signature would provide an interesting indica- 
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Fig. 7. — Decomposition of the 21 cm-galaxy cross spectrum 
for a model in which the minimum detectable galaxy host mass 
is below the minimum host mass for ionizing sources. Here we 
show the decomposition for a model with M x m ; n = 10 10 MQ/h 
and Mg jIn ; n = 10 s Mq/H. Top panel: As in Figure 2, the abso- 
lute value of the 21 cm-galaxy cross spectrum, as well as the x- 
gal, p — gal and three-field terms. Additionally, we show the cross 
power spectrum between the neutral hydrogen field and the den- 
sity field itself. Unlike in our previous models, the three-field and 
p — gal terms do not perfectly cancel each other on small scales. 
This results because there are detectable galaxies outside of ion- 
ized regions in this model. Consequently, the 21 cm-galaxy cross 
spectrum changes sign around A; ~ l/i Mpc -1 and goes positive on 
small scales. Bottom panel: The cross-correlation coefficient be- 
tween the 21 cm and galaxy fields, as well as the cross-correlation 
between the neutral hydrogen and density fields. 



tion of a small escape fraction from low mass galaxies. 
Moreover, this signature is relatively direct - any indica- 
tion of a small escape fraction from low mass galaxies in 
the 21 cm auto spectrum will be more subtle, and likely 
degenerate with other effects. Note, however, that neu- 
tral hydrogen in the galaxies themselves may also result 
in a positive small-scale cross spectrum (§3), and it might 
be tricky to distinguish this from our escape fraction sce- 
nario. 

In order to understand this effect better, it is useful to 
calculate each of the terms in Equation (1) separately. In 
Figure 7 we examine each of these terms for our model 
with M Xi , nin = lO w M Q /h and M g , min = l0 8 M Q /h. In 
this case, the three-field and p — gal terms do not cancel 
each other, unlike in our fiducial case (Figure 2). This 
occurs because in this model low mass galaxies do not 
leak ionizing photons into the IGM, and can hence reside 
outside of the ionized regions which - in this model - are 
formed only by sources residing in higher mass halos. 
In this way, some low mass halos escape the 'masking' 
effect of the ionized regions (see §2.2 and Equation 3), 
and - since these low mass galaxies are correlated with 
the underlying density field - produce a positive small 
scale 21 cm-galaxy cross power spectrum. 
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5. THE IMPACT OF LYMAN-LIMIT SYSTEMS 

Next we consider the impact of Lyman-limit systems 
on our 21 cm-galaxy cross spectrum calculations. Once 
rcionization is complete, most ionizing photons are ab- 
sorbed in dense blobs of neutral gas known as Lyman- 
limit systems. Lyman-limit systems can also limit the 
mean free path of ionizing photons and halt the growth 
of H II regions (Furlanetto & Oh 2005, McQuinn et al. 
2007a) during reionization itself, particularly towards the 
end of rcionization. The precise physical nature and 
abundance of these systems at high redshift is highly 
uncertain, as is their role as photon sinks during reion- 
ization. Lyman-limit systems may be especially numer- 
ous and have a strong effect if 'mini-halos' - halos with 
mass less than the atomic cooling mass - manage to 
survive pre-heating prior to rcionization (Oh & Haiman 
2003) and are abundant during reionization (Haiman et 
al. 2001, Barkana & Loeb 2002, Shapiro et al. 2004). 

In order to quantify the impact of Lyman-limit sys- 
tems on the 21 cm- galaxy cross spectrum, we use the 
hybrid simulation scheme of Zahn et al. (2007), general- 
ized to include the recombination excursion-set barrier of 
Furlanetto & Oh (2005). In order to capture the small-fc 
power spectrum - where we expect the Lyman-limit sys- 
tems to have the most impact - we use the L^ ox = 130 
Mpc//i hybrid simulation. In this section, for simplicity, 
our hybrid simulation adopts the pure Press-Schcchtcr 
ionization barrier of Furlanetto et al. (2004), rather than 
the halo-smoothing algorithm (see Zahn et al. (2007) for 
comparisons). Since the present work is the first to in- 
corporate the recombination barrier into a hybrid sim- 
ulation scheme, we briefly review this model here, but 
refer the reader to Furlanetto & Oh (2005) for more de- 
tails. The recombination barrier reflects the requirement 
that for an H II region to grow, the instantaneous rate 
of photon production from the sources within the H II 
region must at least match the recombination rate of the 
ionized material inside the H II region. 

In Furlanetto & Oh (2005), the recombination rate 
is calculated using the model of Miralda-Escude et al. 
(2000). In this model at any given time, the interior of 
an H II region is ionized up to islands of small-scale over- 
density Aj, above which the gas is neutral. The mean 
free path to ionizing photons is then determined by the 
volume- filling factor of these overdensc islands. In par- 
ticular, the proper mean free path to ionizing photons is 
given by: 

A(^) = Ao(z)[l-F„(A 4 )r 2/3 , (4) 

where F t ,(A i ) denotes the volume-filling factor of regions 
with A < Aj, and Xo(z) is a normalization factor, which 
is given by Xo(z)H(z) — 60 km/s in Miralda-Escude et al 
(2000). Here we leave Ao(z) as a free parameter to gauge 
the dependence of our results on the observationally and 
theoretically uncertain mean free path. The filling factor 
F v (Ai) is computed using the gas density pdf in Miralda- 
Escude et al. (2000). Similarly, the recombination rate 
for the ionized gas, in a region of large scale over-density 
S ionized up to an overdensity Aj, is given by: 

A(S,Ai) = a A n e (l + S) / dAA 2 P(A). (5) 
Jo 

Here cka denotes the case-A recombination coefficient 
for the ionized gas, which we assume to be at 10 4 K, 



n e denotes the mean electron density in the IGM, and 
P(A) is the gas density pdf from Miralda-Escude et al. 
(2000). We assume helium is mostly singly- ionized, but 
not doubly-ionized, within the bubble interiors. The re- 
combination rate formula assumes that the density pdf, 
P(A), is independent of large-scale overdensity, S, which 
should be a good approximation for the large scales rel- 
evant here (Furlanetto & Oh 2005). 

With this formula for the recombination rate in hand, 
Furlanetto & Oh (2005) write down an excursion set bar- 
rier for a region of size R and overdensity S to overcome 
recombinations and be ionized by interior sources. In our 
notation, this formula is: 

C^f^>A(S,R), (6) 

where £ denotes the ionizing efficiency of the sources, 
/coil denotes the collapse fraction in halos above the min- 
imum host halo mass, and R is equated with the mean 
free path, A, which sets A^ through Equation (4), and 
A(5,R) through Equation (5). We implement this bar- 
rier, and apply it in a Monte Carlo fashion (Zahn et al. 
2007, Mesinger & Furlanetto 2007a), in conjunction with 
the normal Furlanetto et al. (2004) barrier. This bar- 
rier effectively prohibits ionizing photons from propagat- 
ing long distances, as regulated by the parameter Xo(z), 
and decreases the level of ionization fluctuations on large 
scales. 

In order to see how this can impact the 21 cm-galaxy 
cross power spectrum, we calculate the signal with hybrid 
simulations of varying Xo(z). In particular, we consider 
(xi) = 0.8 and vary An(z) over the range X (z)H(z) — 
5 — 60 km/s. We span here a rather broad range of mod- 
els, which is appropriate given our limited observational 
constraints on the mean free path to ionizing photons 
at high redshift. The results of calculations with no 
Lyman-limit systems, and each of Xo(z)H(z) = 5 and 10 
km/s, are shown in Figure 8. Since Xq(z)H{z) — 5 km/s 
is l/12th of the fiducial Miralda-Escude et al. (2000) 
value, our values represent rather extreme choices for 
the mean free path. A more meaningful characterization 
than Ao(^), is the characteristic scale where the recom- 
bination barrier (Equation 6) crosses the usual Furlan- 
etto et al. (2004) barrier (see Furlanetto & Oh 2005). 
We denote the scale where the two barriers cross by 
-Rrcc,barr- For models with Xo(z)H(z) = 5, 10 km/s (and 
{x^ = 0.8, z = 6.90), the barriers cross at respective radii 
of i?rec,barr = 4 and 7 Mpc/h. Note that, for our choice of 
model parameters, the recombination barriers are not so 
steep, and so some fraction of points do manage to cross 
the barriers on smoothing scales roughly twice as large 
as i?rcc,barr- Comparing the red and black model curves, 
we see that the hybrid scheme accurately captures the 
21 cm-galaxy cross power spectrum signal from our full 
radiative transfer simulations in the no Lyman-limit sys- 
tem case as in Figure 3. 

The model curves with Xq(z)H(z) = 10 km/s 
(-Rrec.barr = 7 Mpc//i) and Xq{z)H{z) = 5 km/s 
(-Rrec.barr = 4 Mpc/h) illustrate that decreasing the mean 
free path forces the 21 cm-galaxy cross power spectrum 
to turn over towards large scales, rather than simply flat- 
tening out on large scales as in our fiducial model. 6 The 

6 In our fiducial model, the 21 cm-galaxy cross power spectrum 
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Fig. 8. — Dependence of the 21 cm-galaxy cross power spec- 
trum on the abundance of Lyman-limit systems. Top panel: The 
absolute value of the cross spectrum. Bottom panel: The cross- 
correlation coefficient. The black solid line shows the cross spec- 
trum in our fiducial model at (zj) = 0.8 (z = 6.90). The red 
dotted line shows an equivalent model from our hybrid simulation 
scheme. The green short-dashed and blue long-dashed lines show 
cross spectrum calculations for models with abundant Lyman-limit 
systems (see text). The recombination barrier scales (see text) for 
the models with \q(z)H(z) = 5 km/s, and \q(z)H(z) = 10 km/s 
are -R rec ,barr = 4, 7 Mpc/h respectively. The Lyman-limit systems 
force the cross spectrum to turn-over towards large scales, but the 
effect is relatively mild. 

cross spectrum develops a more well denned character- 
istic scale as the mean free path decreases. This trend 
results because decreasing the mean free path limits the 
formation of very large H II regions which in turn re- 
duces the amount of large scale cross power. Although 
the figure illustrates a clear trend of decreasing large scale 
power with decreasing mean free path, we caution that 
there is no simple one-to-one correspondence between 
mean free path and H II region size. As a specific il- 
lustration of the distinction between the mean free path 
and H II region size, consider the post-reionization IGM. 
In the post reionization IGM, essentially the entire vol- 
ume of the IGM is ionized, and the bubble size hence 
infinite, while the mean free path is still finite. 

Note that although the figure shows a clear trend of 
decreasing large scale power with decreasing mean free 
path, the dependence on the abundance of Lyman-limit 
systems is rather weak. This may preclude strong con- 
straints on the abundance of Lyman-limit systems from 
future measurements of the 21 cm-galaxy cross power 
spectrum. On the other hand, it implies that the mean 
free path to ionizing photons is not a very important fac- 
tor in our modeling of the cross spectrum. This should 

should turn over on some scale larger than that of our simulation 
box. Note however that foreground contamination may make such 
scales inaccessible to future 21 cm observations (McQuinn et al. 
2006). 



allow us to robustly constrain other parameters from fu- 
ture measurements, in spite of our ignorance of the high 
rcdshift mean free path. 

6. DEPENDENCE ON GALAXY-SELECTION TECHNIQUE 

In this section, we consider the dependence of the 21 
cm-galaxy cross power spectrum on the manner in which 
the galaxies are selected. Thus far we have calculated the 
21 cm-galaxy cross power spectrum by cross-correlating 
our 21 cm field with all simulated halos above some min- 
imum detectable halo mass cut. In other words, we as- 
sume that each simulated dark matter halo contains one 
luminous galaxy, and that the flux limit of our hypothet- 
ical galaxy survey corresponds precisely to a minimum 
host halo mass. This is clearly a vast simplification, and 
so it is important to explore the signal's sensitivity to 
the minimum mass cut. Note, however, that since we 
consider only scales much larger than the halo virial ra- 
dius, we are not sensitive to the distribution of galaxies 
within each host halo (e.g. Scoccimarro et al. 2001). 
Another important effect is that galaxies selected on the 
basis of Ly-a emission will have a different 21 cm-galaxy 
cross-correlation than galaxies selected by, for example, 
the Lyman-break technique. We presently explore the 
sensitivity of our results to the type of galaxy selected by 
our hypothetical survey. 

6.1. Minimum Detectable Mass 

Let us first fix the population of galaxies responsi- 
ble for reionizing the IGM, and the resulting ionization 
field, while varying the minimum host halo mass contain- 
ing galaxies detectable by our hypothetical survey. We 
explored this issue somewhat already in §4, but there 
we focused on scenarios in which ionizing photons do 
not escape from low mass galaxies - i.e., cases where 
-^g,min < -Mx,min- Here we focus on models in which ion- 
izing photons manage to escape from low mass halos, yet 
such sources are too faint to be detectable by our hypo- 
thetical galaxy survey. In other words, we consider cases 
where M gtin x n > M Xirmn . This is likely the more relevant 
case for first generation surveys where it will be difficult 
to detect the presumably faint galaxies that reside in low 
mass halos. 

Another point is that it is unlikely that all halos above 
some M gjm j n host galaxies that actively produce de- 
tectable photons at any given instant of time. In other 
words, the 'duty cycle' - the fraction of halos above a 
given mass which contain galaxies actively radiating at a 
particular time - is likely less than unity. As quantified 
below, varying the minimum detectable host mass im- 
pacts the mean 21 cm- galaxy cross power spectrum, but 
reducing the duty cycle of detectable galaxies does not 
by itself change the average cross power spectrum signal. 
This is because galaxy bias is independent of duty cycle, 
provided that the duty cycle is itself independent of mass 
for halos above the minimum detectable host mass. De- 
creasing the duty cycle instead increases the level of Pois- 
son fluctuations in the galaxy abundance, which increases 
the cross spectrum variance - and makes the cross spec- 
trum more difficult to detect (Furlanetto & Lidz 2007, 
§7) - while preserving the average cross power spectrum. 
Presently, we focus on how the minimum host halo mass 
impacts the mean signal, and defer a discussion of the 
signal variance to §7. Of course our assumption that the 
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Fig. 9. — Dependence of 21 cm-galaxy cross power spectrum 
on the minimum detectable galaxy mass. Top panel: The cross 
power spectrum for a survey that detects all galaxies in halos of 
mass larger than M gmin = 10 9 M e /h, M giInin = 10 10 M e /h, 5 X 

10 10 Mq /h and 10 11 Mq /h respectively. For each model curve, we 
fix the minimum host mass of the ionizing sources at M x m ; n = 
10 8 Mq//i. Bottom panel: The correlation coefficient between the 
galaxy and 21 cm fields in each case. 

duty cycle is independent of host mass may be too sim- 
plistic and modifying this assumption may impact our 
results in detail. Our simple model should, however, suf- 
fice to illustrate the basic sensitivity to host halo mass. 
Moreover, in practice one can constrain the run of duty 
cycle with halo mass from the observed galaxy luminosity 
function and galaxy-galaxy auto power spectrum, which 
can then inform models for the 21 cm-galaxy cross spec- 
trum. 

The results of varying the minimum detectable host 
halo mass are shown in Figure 9. Here we use the 
70 Mpc/h hybrid simulation, fix M Xjmin = 10 s M Q /h 
(just a little above the atomic cooling mass), and vary 
M gimin from 10 9 M Q /h to 10 10 M Q /h, 5 x 10 w M Q /h, and 

10 11 M Q /h. On large scales one sees the usual increase 
in the amplitude of the cross spectrum as M &>m i n in- 
creases. On small scales, the cross spectrum turns over 
on progressively smaller scales as M gtmin decreases, and 
the cross-correlation starts to sample the small bubbles 
around the lower mass halos. This is a continuation of the 
behavior seen in Figure 5. It illustrates that the turnover 
scale needs to be interpreted with caution, since it is 
sensitive to M g!m j n . The dependence of turnover scale 
on luminosity is very interesting, however; examining it 
amounts to a measurement of the characteristic bubble 
size around galaxies of varying luminosity. In order to 
best constrain this dependence one needs a galaxy sur- 
vey with a sufficiently large dynamic range in luminosity, 
and one needs to examine the luminosity dependence of 
the galaxy luminosity function, auto spectrum and cross 
spectrum. This also highlights the scientific benefit of 



measuring the 21 cm-galaxy cross spectrum, as it is im- 
possible to determine the luminosity dependence of the 
bubble size distribution from the 21 cm auto spectrum 
alone. 

6.2. Lyman-alpha Selected Galaxies 

A successful approach for finding high redshift galaxies 
is to search for Ly-a emission, which is frequently strong 
in young galaxies (Partridge & Peebles 1967). There are 
numerous existing and planned Ly-a emitter (LAE) sur- 
veys (e.g. Rhoads et al. 2004, Kashikawa et al. 2006, 
Stark et al. 2007), with the Subaru telescope currently 
providing the largest high redshift sample, consisting 
of ~ 58 photometric LAEs at z = 6.5 discovered in a 
~ 30' x 30' field (Kashikawa et al. 2006). LAE surveys 
have an advantage over high redshift Lyman break sur- 
veys in that they target narrow wavelength intervals, in 
between strong night sky background lines, in search of 
strong emission lines. This allows one to detect galaxies 
that are unobservable by Lyman break selection owing 
to the strong night sky background at the relevant wave- 
lengths; sizable Lyman-break galaxy catalogues likely 
await a widefield, near-infrared instrument in space or 
30-meter class telescopes on the ground. Existing LAE 
surveys and their extensions hence likely provide the first 
opportunity to detect the 21 cm-galaxy cross power spec- 
trum, particularly if the IGM is partly neutral at z ~ 6.6 
(Wyithe & Loeb 2007, Furlanetto & Lidz 2007, §7). 

To this end, we would like to consider the cross cor- 
relation between 21 cm and Ly-a selected galaxies. In 
contrast to galaxies selected via, for example the Ly- 
man break or H-a, the abundance of observable Ly-a 
selected galaxies will be modulated by the presence of 
neutral hydrogen, impacting their clustering (Furlanetto 
et al. 2006c, McQuinn et al. 2007a, 2007b, Mesinger 
&: Furlanetto 2007b, 2007c) and the 21 cm-galaxy cross 
power spectrum. This modulation occurs because damp- 
ing wing absorption extinguishes the Ly-a line for sources 
sufficiently close to the edge of an H II region (Miralda- 
Escude 1998), where there is an adjacent column of neu- 
tral hydrogen. This means Ly-a selected galaxies will 
lie towards the center of large-ish, R > 1 proper Mpc, 
H II regions (Furlanetto et al. 2006c, McQuinn et al. 
2007a, 2007b, Mesinger & Furlanetto 2007b, 2007c). Ow- 
ing to this, and because observable galaxies will have 
larger masses after Ly-a selection, the clustering of Ly-a 
selected galaxies should increase as such galaxies are de- 
tected at earlier and earlier stages of reionization. Thus 
far our mock galaxies have been uniformly selected - 
i.e., not modulated by the presence of neutral hydro- 
gen. While this is appropriate for Lyman break selected 
galaxies, it is incorrect for Ly-a selected galaxies before 
reionization completes. 

In order to examine the impact of Ly-a selection on the 
21 cm-galaxy cross spectrum, we compute the damping 
wing optical depth, rrj, towards each of our target halos. 
For simplicity, we calculate only the damping wing opti- 
cal depth at line center (see e.g. Equations (1) and (2) 
of Mesinger & Furlanetto 2007c) and do not model res- 
onant absorption (see McQuinn et al. 2007b, Mesinger 
&: Furlanetto 2007b, Dijkstra et al. 2007 for discussion). 
Assuming that each source's luminosity is proportional 
to its host halo mass, and adopting our fiducial choice of 
-Wg.min = 10 10 Af Q , our Ly-a survey detects sources with 
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Fig. 10. — The 21 cm-galaxy cross power spectrum for 
Ly-a selected galaxies. Top panel: The 21 cm-galaxy cross 
power spectrum for each of Ly-a selected galaxies (dashed lines) 
and 'all galaxies' (solid lines). We show results at ((xi),z) = 
(0.21,8.34), (0.54, 7.32), (0.82,6.90), and M g , min = 10 10 M©. Bot- 
tom panel: The cross-correlation coefficient between the 21 cm and 
galaxy fields for the model curves in the top panel. 



Mexp[-r D ] > M g , min . 

With mock Ly-a selected galaxy catalogues in hand, 
we calculate the cross power spectrum of these galaxies 
with the 21 cm field for a few outputs of differing ioniza- 
tion fractions and redshifts. The results of this calcula- 
tion are shown in Figure 10. Comparing the cross spec- 
tra of the Ly-a selected galaxies (top panel, dashed lines) 
and uniformly selected galaxies (solid lines), we see that 
the large-scale amplitude of the cross spectra are higher, 
and that the signal turns over on larger scales, for the 
Ly-a selected galaxy samples. It is easy to understand 
these trends qualitatively. For a galaxy to be visible in 
Ly-a it must reside in a sufficiently large H II region - 
or more accurately, it needs to reside along a sufficiently 
long ionized skewer - to avoid complete attenuation ow- 
ing to damping wing absorption. The largest H II regions 
form around the most clustered sources, and so the galax- 
ies detectable in Ly-a are more clustered than uniformly 
selected galaxies of the same host halo mass (Furlanetto 
et al. 2006c, McQuinn et al. 2007a, 2007b, Mesinger 
& Furlanetto 2007b, 2007c). This enhanced clustering is 
reflected in the boosted large scale 21 cm-galaxy cross 
power spectrum. Likewise, the turnover on small scales 
is set by the characteristic H II region size around de- 
tectable galaxies, which increases for the Ly-a selected 
galaxies: Ly-a galaxies residing in small bubbles are at- 
tenuated out of the sample by damping wing absorption. 

This is visualized most clearly in the cross-correlation 
coefficient between the 21 cm and galaxy fields (Figure 
10, bottom panel). In the uniformly-selected galaxy sam- 
ple, the correlation coefficient turns over on progressively 
larger scales as reionization proceeds. By contrast, the 



small scale turnover in the Ly-a selected sample is rela- 
tively fixed. At early times when the bubbles are small, 
the turnover in the cross spectrum with the Ly-a se- 
lected sample largely reflects the damping wing scale. In 
order to best characterize bubble growth in the early and 
middle stages of reionization, one requires a uniformly- 
selected galaxy sample, rather than a Ly-a selected sam- 
ple. Finally, note that the cross correlation coefficient in 
the Ly-a selected samples does not quite reach r = — 1 
on large scales. This behavior is enhanced early in reion- 
ization, and presumably results because small bubbles 
are missed by Ly-a selection, which contribute most sig- 
nificantly at low ionized fractions. In summary, while 
Ly-a selected samples will be interesting for initial cross 
spectrum detections, uniformly-selected samples will be 
required to best constrain bubble growth during reion- 
ization. 

7. DETECTABILITY 

In this section we calculate the statistical significance 
at which future surveys can detect the 21 cm-galaxy cross 
spectrum and briefly consider the resulting insights into 
reionization. Here we follow closely the calculations in 
Furlanetto & Lidz (2007), simply extending them to in- 
corporate our simulated cross spectrum signal. In our 
calculations we consider a 21 cm survey with the speci- 
fications planned for each of the MWA (Bowman et al. 
2006, McQuinn et al. 2006, Mao et al. 2008) and LO- 
FAR, which we review below (§7.2 and §7.3). We con- 
sider two basic types of galaxy surveys. First, we con- 
sider a survey similar to the Subaru deep field survey for 
Ly-a emitters (Kashikawa 2006). Since the Subaru sur- 
vey is ongoing, this calculation should illustrate what is 
achievable in the near future as the MWA and LOFAR 
come online. Note that the present Subaru deep field 
does not overlap with the planned MWA target fields 
(M. Morales, private communication, 2008), but our cal- 
culations still serve to illustrate what is possible in the 
near future. Next, we consider a more futuristic galaxy 
survey. Coupling our futuristic galaxy survey with the 
MWA or LOFAR, one can potentially measure the cross 
power spectrum at several redshifts, probing the evolu- 
tion in the cross spectrum signal, and tracing the growth 
of H II regions during reionization as in Figure 4. 

7.1. Statistical Error Estimates 

To begin with, we describe our statistical error esti- 
mates, reviewing the formulae for cross spectrum error 
bars for a survey of given specifications, incorporating 
sample variance, thermal noise in the 21 cm radio tele- 
scope, and shot-noise and redshift errors in the galaxy 
distribution. Here we restrict ourselves to the spheri- 
cally averaged cross spectrum, since the MWA and LO- 
FAR have limited transverse sensitivity and since very 
precise galaxy redshifts will be required to measure the 
angular dependence of the cross spectrum (Furlanetto & 
Lidz 2007). 

We generally find it convenient to estimate error 
bars on the cross-correlation coefficient, r(k), rather 
than on the cross spectrum itself. We desire an 
estimate of the error bar on r(k) calculated from 
spherically averaged auto and cross spectra in a bin 
of logarithmic width e = dink. For notational 
convenience let us denote the cross-correlation co- 
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efficient by r(k) = J P2i,gai(fc)/[P 2 i(fc)P ga i(fc)] 1/2 - 
A(k)/[B{k)C{k)] 1 / 2 . Propagating errors, the fractional 
error on the cross-correlation coefficient is: 



AB 



(k) 



U AC 

AC 



(k) 



U BC 

2BC 



(k). 



(7) 



This expression involves the the cross spectrum variance, 
the 21 cm and galaxy power spectrum variances, and the 
co- variance between the various power spectra, each cal- 
culated for spherically averaged power spectra in shells 
of logarithmic width e. The one disadvantage of con- 
sidering the cross-correlation coefficient is that the cross 
spectrum can, under appropriate circumstances, be de- 
tected at higher sensitivity than the 21 cm auto spectrum 
(Furlanctto & Lidz 2007). In this case, the error bar on 
the cross correlation coefficient, which includes an error 
term from the auto spectrum, will be larger than that 
for the cross spectrum alone. Furthermore, estimating 
the cross-correlation coefficient requires an auto spec- 
trum estimate and is hence more susceptible to residual 
foreground contamination than the auto spectrum alone. 

Consider first the power spectrum variance terms for 
a single fe-mode, with line of sight component ku = fik, 
restricting ourselves to modes in the upper-half plane. 
The power spectrum variance expressions are as follows 
(Furlanetto & Lidz 2007): 

oi(fc.M) = var [-P21,gal(fc,M)] 

= o [ P 2 2 i :g ai( fc >M) +(T B (k,fj,)a c (k,iJ,)] , (8) 



= var [P 2 i(fc,/i)] 



T 2 



T 2 Bt int n{k ± ) 



D 2 AD /A 2 



0c( fc >M) = var[P ga i(fc,/i)] 



(9) 



(10) 



The second term in Equation (9) comes from thermal 
noise in the radio telescope, the second term in Equa- 
tion (10) expresses the shot noise error, while the other 
terms in the above equations are sample variance contri- 
butions. The thermal noise term depends on the system 
temperature, T sys ; the co-moving distance to the cen- 
ter of the survey at redshift z, D(z); the survey depth, 
AD; the observed wavelength, A; the effective area of 
each antenna tile, A e ; the survey bandwidth, B; the to- 
tal observing time, tint; and the distribution of anten- 
nas. The factor To in the denominator of the detector 
noise term arises because we normalize the 21 cm field 
by To so that it is dimcnsionless - i.e., we work with the 
field St/T . The dependence on antenna configuration is 
encoded in n(k±) which denotes the number density of 
baselines observing a mode with transverse wavenumber 
k± (McQuinn et al. 2006, Bowman et al. 2006, Lidz 
et al. 2007b). The galaxy shot-noise term depends on 



n ga i which is the abundance of galaxies observable in our 
hypothetical survey, and on the accuracy of the galaxy 
rcdshifts obtained by the survey. The galaxy redshift 
error is given in co-moving units by <r x — ca z /H(z). 

We also require expressions for the co- variance between 
the different power spectra. These can be computed 
straightforwardly: 

^ab(^.m) = cov [P2i,g^(k,fi),P2i(k,fi)] 

= P 2 l,gal(fc,M)^2l(fc,M), (H) 
2 Ac{ k iV) = cov [-P21,gal(fc,j"),-Pgal(fc,M)] 

= ^21,gal(fc,M)^gal(fc), (12) 

^bc ( k > /") = cov [ p 2i (k, n) , Pgai {k,n)\ 

= A 2 l,gal(fc,M)- ( 13 ) 

Finally, we can estimate the error bar on the cross- 
correlation coefficient formed from our spherically aver- 
aged power spectra. We do this by adding the power 
spectrum error bars for individual fc-modes from Equa- 
tions (8)-(13) in inverse quadrature, performing a simi- 
lar calculation for each individual term in Equation (7). 
For example, the variance of the cross-spectrum averaged 
over a spherical shell of logarithmic width e = dink is: 

1 e ^ ^survey A A* 



a\(k) 



4tt 2 



a\{k,n)' 



(14) 



The effective survey volume for our radio telescope is 
Purvey = D 2 AD (A 2 /A e ) . If the galaxy survey has a 
lesser volume, V ga ,\, then the variance of the binned power 
spectrum estimated from this lesser volume (for a mode 
contained within the lesser survey volume) is larger by a 

factor Of ~ Vgal/Kmrvcy. 

7.2. The MWA 

With these expressions in hand let us briefly describe 
the specifications we assume for our 21 cm and galaxy 
surveys. The MWA will have a large field of view, span- 
ning ~ 800 deg 2 on the sky, and consisting of 500 an- 
tenna tiles each with an effective area of A e = 14m 2 at 
z — 8 (Bowman et al. 2006). Each antenna tile is 4m 
wide, and we follow Bowman et al. (2006), McQuinn et 
al. (2006) in assuming that the antennas are packed as 
closely as possible within a compact core, with the distri- 
bution subsequently falling off as r~ 2 in order to capture 
large baselines, out to a maximum baseline of 1.5 km. 
Lidz et al. (2007b) argued that a compact antenna con- 
figuration, with all of the MWA's antennas packed as 
close as possible, is a superior configuration for 21 cm 
auto spectrum measurements. This configuration is less 
good for the cross spectrum: given a galaxy survey with 
photometric redshifts, one needs to balance the MWA's 
high line-of-sight sensitivity, yet poor transverse sensitiv- 
ity, with the galaxy survey's high transverse sensitivity, 
yet poor line-of-sight sensitivity owing to redshift uncer- 
tainties. 

We assume that the system temperature is set by the 
sky temperature, which we take to be T S ys = 280(1 + 
z/7.5) 2 ' 3 K, following Wyithc & Morales (2007). We 
consider a bandwidth of B = 6 Mhz observing for a total 
time of ti n t = 1000 hrs. The bandwidth is chosen to be 
small enough to ensure that the signal evolves minimally 
over the corresponding redshift interval (McQuinn et al. 
2006). 
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7.3. LOFAR 

LOFAR and the MWA are expected to have compa- 
rable sensitivity for detecting the 21 cm auto spectrum 
(McQuinn et al. 2006, Mao et al. 2008). 7 LOFAR will 
observe a smaller field of view than the MWA (by a fac- 
tor of > 10), but its larger collecting area compensates 
for its reduced sky coverage. The larger field of view 
of the MWA is, however, wasted when cross-correlating 
with a galaxy survey that covers a much smaller patch 
on the sky. We anticipate then that LOFAR should at 
least initially provide a more sensitive detection of the 
21 cm-galaxy cross spectrum than the MWA (Furlanetto 
& Lidz 2007). 

The precise collecting area and antenna configuration 
for LOFAR are still evolving, but we follow the simple 
model of McQuinn et al. (2006) as a plausible estimate. 
LOFAR will consist of 32 large antenna stations within 1 
km, with minimum baselines of 100 m. Each LOFAR 
station simultaneously observes 4 separate regions on 
the sky. We assume that LOFAR's antenna stations are 
closely packed in a compact core, before tapering off in 
an r~ 2 configuration out to a maximum radius of 1 km. 
The effective area of each antenna is 656 m 2 at z = 8, and 
we linearly interpolate between the values in McQuinn et 
al. (2006) (their Table 1), to find the collecting area at 
other redshifts. As for the MWA, we consider 1, 000 hrs. 
of LOFAR observations over a bandwidth of B = 6 Mhz. 

7.4. Subaru-like Survey 

We first consider the dctectability of the 21 cm-galaxy 
cross spectrum obtainable by combining the MWA and 
LOFAR with the Subaru deep field survey, and plausible 
extensions. The existing Subaru deep field survey has a 
0.25 deg 2 field of view and locates Ly-a emitters near z = 
6.6 to a depth of 130 A. The existing spectroscopically- 
confirmed Subaru deep field sample at redshift z = 6.6 
consists of 36 emitters (Kashikawa et al. 2006). The 
number density of spectroscopically-confirmed emitters 
corresponds to n ga i = 1.6 x 10~ 4 Mpc -3 . An extension to 
the Subaru deep field, the Subaru/XMM-Newton Deep 
Survey is already underway, and promises to increase the 
observed z = 6.6 field of view by a factor of ~ 4, reaching 
a survey area of A survey ~ 1 deg 2 by the end of the year 
(Ouchi 2005). 

Given the rapid progress in area surveyed, we exam- 
ine how the detectability of the cross spectrum scales 
with increasing field-of-view, at fixed depth and galaxy 
number density. In practice, we calculate the cross spec- 
trum S/N for a galaxy survey that covers the full field- 
of-view of the MWA (~ 800 deg 2 at this redshift), and 
scale the signal-to-noise (squared) in each fc-bin down- 
wards by the ratio of the galaxy survey volume to that 
of the MWA (see Equation 14). We perform a similar 
calculation for LOFAR. For each model cross-spectrum, 
our S/N estimates assume a galaxy number density of 
n gai = 1.6 x 10 -4 Mpc -3 , and redshift errors for the 
spectroscopically confirmed galaxies of a z — 0.01. The 
assumed redshift error corresponds to a velocity of sev- 
eral hundred km/s, motivated by the typical velocity off- 

7 We recently learned that budget setbacks are forcing LOFAR 
to reduce its collecting area. We are unaware of the details of the 
reduction, but this will reduce LOFAR's sensitivity compared to 
our estimates here. 
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Fig. 11. — Signal to noise for cross spectrum detection. The sig- 
nal to noise at which Subaru-like surveys, coupled with the MWA 
(solid lines) and LOFAR (dashed lines), can detect the 21 cm- 
galaxy cross spectrum as a function of survey area at z = 6.6. The 
different curves indicate different models for the ionization frac- 
tion. Each curve extends from the current Subaru area (~ 0.25 
deg 2 ) to the full field of view of MWA (~ 800 deg 2 ) or LOFAR 
(~ 70 deg 2 ). The red dotted line indicates a 3 — <r detection of the 
cross spectrum. 

sets for Ly-a lines observed by Shapley et al. (2003) in 
Lyman break galaxies at z ~ 3. The total S/N is de- 
termined by summing the signal squared divided by our 
variance estimate (Equations 8 and 14) over all detected 
fc-bins. 

In Figure 11 we show the detectability of the cross 
spectrum for a few different models over a range of sur- 
vey areas. In each model we adopt a plausible mini- 
mum detectable galaxy mass of M g , min = 1O 1O M , fix- 
ing the galactic duty cycle to match the observed Sub- 
aru deep field abundance of n ga i = 1.6 x 10 -4 Mpc -3 . 
The corresponding duty cycle in our models is around 
~ 1%. Given that we currently have few direct observa- 
tional constraints on the filling factor of H II regions near 
z = 6.6, we consider models in which the ionized frac- 
tion is (Xi) = 0.54, 0.82, and 0.96 at this redshift. Strictly 
speaking, we should use our Ly-a selected cross spectrum 
models here, but at these ionization fractions we expect 
this to boost our S/N only slightly (Figure 10). For 
simplicity, we conservatively ignore the clustering-boost 
from Ly-a selection here. The amplitude of the cross 
spectrum is largest amongst these models at (x/) = 0.54, 
and is substantially smaller by (x/) = 0.96 (see Figure 4), 
and so the more neutral models will be easier to detect. 

The results shown in Figure 11 illustrate that cross- 
correlating the MWA with a galaxy survey of size compa- 
rable to the present Subaru deep field survey will not al- 
low a significant cross spectrum detection (S/N < 1— a), 
even if the IGM is significantly neutral at z = 6.6. How- 
ever, extensions to the Subaru deep field that cover a 
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larger area on the sky should yield significant cross spec- 
trum detections. For example, extending the present sky 
coverage by a factor of ~ 10 — 15 to 3 deg 2 should provide 
a > 2 — 3fi cross spectrum detection in our (xi) — 0.54 
and (x^ = 0.82 models, but only a ~ 1 — a detection in 
our (x^ — 0.96 model. As anticipated, cross-correlating 
with LOFAR can improve the S/N by a factor of a few. 
Cross-correlating LOFAR with a galaxy survey of only 
1 — 2 deg 2 should allow a 3 — a cross spectrum detection 
in our (x<) = 0.54 and (xi) — 0.82 models. As men- 
tioned earlier, the Subaru survey should reach this sky 
coverage soon, making a cross spectrum detection fea- 
sible in the next few years if the IGM is partly neutral 
around z ~ 6.6. More ambitious surveys covering the 
entire MWA sky ~ 800 deg 2 would clearly move beyond 
mere detections - the detection S/N for such surveys is 
at the tens of sigma level (see Figure 11) - and provide 
valuable constraints on reionization models. 

7.5. Futuristic Survey 

Since more futuristic surveys will go beyond mere de- 
tections, we proceed to consider the constraining power 
of a large field-of-view galaxy survey - cross-correlated 
with the MWA - in more detail. Futuristic surveys will 
allow one to probe small scales, capture the turnover 
in the cross-correlation coefficient and hence constrain 
bubble growth during reionization. We calculate the ex- 
pected error bar on the cross correlation coefficient as 
a function of wavenumber for a galaxy survey spanning 
the full MWA field of view, and consider the ability of 
this survey to constrain reionization models. Here we as- 
sume that the galaxy survey can detect fainter galaxies, 
reaching a galactic abundance 100 times larger than in 
the previous section, with the same redshift accuracy of 
a z =0.01. We consider a redshift of z — 7.3. 

Using again the models of §3 as input, we estimate 
the statistical sensitivity of our futuristic galaxy survey. 
The results of our sensitivity calculation are shown in 
Figure 12, for spherical bins of logarithmic width e = 
0.5. Here we plot the simulated signal when the IGM is 
~ 50% ionized along with a statistical error estimate for 
our hypothetical survey. For contrast, we additionally 
show theoretical model curves when the IGM is each of 
~ 20% and ~ 80% ionized. 

The curves and errorbars in Figure 12 show that the 
statistical precision of our hypothetical survey is high 
enough to distinguish between the different stages of 
reionization shown in the figure over about a decade in 
scale. On large scales, the measurement is limited by 
foreground removal while on small scales 21 cm detec- 
tor noise and galaxy redshift errors limit the statistical 
precision of the measurements (Furlanetto & Lidz 2007) . 
Although the 21 cm-galaxy cross power spectrum sig- 
nal is much less susceptible than the 21 cm auto power 
spectrum to foreground contamination, free-free and syn- 
chrotron emission from the high redshift galaxies in our 
survey still contaminate the 21 cm-galaxy cross power 
spectrum somewhat (Furlanetto & Lidz 2007) . This pro- 
hibits measuring modes with lines of sight wave num- 
bers k\\ < 2ir/AD, where AD is the depth of the sur- 
vey. The discreteness of the survey means that the only 
modes in our survey that satisfy this requirement have 
fcii = 0, hence all modes with k < k^ will be removed in 
the foreground cleaning process. The black dashed line 
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Fig. 12. — Error estimate for the 21 cm-galaxy cross correlation 
coefficient. Here we consider a futuristic galaxy survey covering 
the entire MWA field of view, cross-correlated with the MWA. 
The blue points show the mean signal and error estimates for our 
hypothetical 21 cm/galaxy survey when (x{) = 0.54. The other 
curves show the cross correlation coefficient when (xi) = 0.21 and 
0.82 respectively. Our hypothetical survey should help constrain 
the volume- weighted ionization fraction. The vertical black dashed 
line shows the wavenumber corresponding to the survey depth, 
below which foreground contamination will prohibit extracting the 
signal. 



indicates the wavenumber corresponding to the survey 
depth in our hypothetical survey. We should remind the 
reader here of one trade-off involved with considering the 
cross-correlation coefficient rather than the cross spec- 
trum alone. The cross-correlation coefficient is a more 
convenient quantity than the cross spectrum for visual- 
izing the small-scale turn-over (see Figure 12), but it is 
less desirable in that it includes the auto spectrum, which 
is more susceptible to foreground contamination. 

The sensitivity estimates shown in Figure 12 are en- 
couraging, and suggest that future 21 cm-galaxy surveys 
may help constrain the filling factor and size distribution 
of H II regions during reionization. Comparing our error 
estimates with the results of Figure 9 suggests that futur- 
istic surveys might also - by measuring the cross spec- 
trum in different galaxy luminosity bins - weakly con- 
strain the dependence of bubble size on host halo mass. 
Note also that the thermal noise term in the 21 cm vari- 
ance (see Equation 9) still contributes significantly for 
most fc-bins shown here, and so futuristic 21 cm surveys 
with more antennas and larger collecting areas than the 
MWA can further improve cross spectrum sensitivity. In 
particular, a future FFT telescope (Tegmark & Zaldar- 
riaga 2008) should boost the sensitivity compared to our 
estimates here (see Mao et al. 2008 for estimates of the 
auto spectrum sensitivity with an FFT telescope). 

8. CONCLUSIONS 
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In this paper, we considered the scientific return of fu- 
ture 21 cm-galaxy cross power spectrum measurements. 
A strong cross spectrum measurement ultimately re- 
quires detecting a sizable number of high redshift galax- 
ies over a large field of view, which presents a significant 
observational challenge. Nonetheless, we showed that a 
detection of the cross spectrum may be achieved in the 
near future by combining LOFAR and the Subaru survey 
for LAEs at z ~ 6.6. We estimate that a <~ 3 — a detec- 
tion is feasible, provided the IGM is > 20% neutral at 
this redshift, and that the Subaru survey's sky coverage 
is extended from 0.25 deg 2 to ~ 2 deg 2 . This detection 
would already be quite valuable, as it would help con- 
firm that the detected 21 cm signal comes from the high 
redshift IGM, and not from foreground contamination, 
which should mostly be uncorrelated with high redshift 
galaxies (Furlanetto & Lidz 2007). 

Futuristic galaxy surveys covering 100s of square de- 
grees on the sky, can be combined with the MWA, LO- 
FAR, and other 21 cm surveys, to move beyond a mere 
detection of the cross spectrum signal and map out its 
detailed scale dependence. The galaxy surveys required 
for these measurements are clearly very challenging, but 
rapid progress is being made in this direction as deep, 
widefield surveys are being designed to study baryonic 
acoustic oscillations and/or weak-lensing at high redshift 
(e.g., ADEPT, HETDEX 8 , CIP, and others). Another 
option is to sparsely sample the MWA or LOFAR fields, 
in order to capture the large-scale modes (Furlanetto & 
Lidz 2007). We have shown that the 21 cm-galaxy cross 
spectrum is a relatively direct tracer of bubble growth 
during rcionization. Measuring the turnover scale as a 
function of galaxy luminosity constrains the luminosity 
dependence of the characteristic bubble size. This infor- 
mation is difficult, or impossible, to obtain with the 21 
cm auto spectrum alone. In order to extract the most 

8 http://www.as.utexas.edu/hctdex/ 



information out of the cross spectrum, it should be com- 
bined with measurements of the galaxy auto spectrum 
and luminosity function, which will help to constrain the 
galaxy luminosity-halo mass correlation. 

A further interesting feature of the simulated signal 
is that the cross-correlation changes sign on large scales 
near the beginning of rcionization (Figure 4). At this 
early phase of reionization, our results may, however, 
be modified by spin temperature fluctuations, which we 
presently neglect. Future work should incorporate these 
fluctuations. If our signature holds up, the change in 
sign of the cross correlation would provide a very inter- 
esting observational indicator of the earliest phases of 
rcionization. Finally, we found that the 21 cm-galaxy 
cross power spectrum might provide an interesting ob- 
servational signature of scenarios where ionizing photons 
fail to escape from low mass halos. Provided galaxies 
in these low mass halos are detectable longward of the 
ionization edge, we expect the cross spectrum to change 
sign and turn positive on small scales. Generally speak- 
ing, the 21 cm-galaxy cross spectrum is a more direct 
tracer of the impact of galaxies on the surrounding IGM 
than the 21 cm auto spectrum. As such, it can poten- 
tially provide a wealth of information about the EoR and 
early structure formation. 
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